Picture for Mario Fritz

Mario Fritz

MATRA: Modeling the Attack Surface of Agentic AI Systems -- OpenClaw Case Study

Add code
May 11, 2026
Viaarxiv icon

The Alpha Blending Hypothesis: Compositing Shortcut in Deepfake Detection

Add code
May 11, 2026
Viaarxiv icon

Automated Detection of Abnormalities in Zebrafish Development

Add code
May 11, 2026
Viaarxiv icon

Trustworthy AI Suffers from Invariance Conflicts and Causality is The Solution

Add code
May 04, 2026
Viaarxiv icon

Self-Improving Tabular Language Models via Iterative Group Alignment

Add code
Apr 21, 2026
Viaarxiv icon

Inspectable AI for Science: A Research Object Approach to Generative AI Governance

Add code
Apr 13, 2026
Viaarxiv icon

Certified Circuits: Stability Guarantees for Mechanistic Circuits

Add code
Feb 26, 2026
Viaarxiv icon

Scalable Delphi: Large Language Models for Structured Risk Estimation

Add code
Feb 09, 2026
Viaarxiv icon

IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery

Add code
Feb 08, 2026
Viaarxiv icon

Funny or Persuasive, but Not Both: Evaluating Fine-Grained Multi-Concept Control in LLMs

Add code
Jan 26, 2026
Viaarxiv icon